An Improved Algorithm for Performance Optimal Technology Mapping with Retiming in LUT-Based FPGA Design

نویسنده

  • Jason Cong
چکیده

The FPGA technology mapping and synthesis problem for combinational circuits has been well studied. But for sequential circuits, most of the previous synthesis and mapping algorithms assume that the positions of ippops are xed and synthesize each combinational block independently. Retiming is a technique to reduce the clock period by repositioning ippops LeSa91]. With retiming, the previous mapping algorithms may not lead to an optimal solution of a sequential circuit by mapping each combinational block independently. Pan and Liu PaLi96b] recently proposed a novel algorithm, named SeqMapII, of technology mapping with retiming for the optimal clock period. The time complexity of their algorithm, however, is O(K 3 n 4 log(Kn 2) log n) for sequential circuits with n gates and targeting K-LUT based FPGAs, which is too high for medium and large size designs in practice (more than 20 hours of CPU time to get the optimal solution for a design of 134 gates on a SPARC10 workstation). In this paper, we present three strategies to improve the performance of the SeqMapII algorithm. First, we show that the monotone property of labels hold for sequential circuits and develop an eecient label update with single K-cut computation. Second, we propose a new K-cut computation method on partial ow networks, which are very small in practice. Finally, SCC (strongly connected component) partition is performed to eliminate many redundant label computation to further speedup the algorithm. In practice, our algorithm works in O(K 2 n 3 logn) time and O(Kn) space according to our experimental results. It is 210 4 times faster than SeqMapII-opt for computing optimal solutions and 2 times faster than SeqMapII-heu which uses very small expanded circuits as a heuristic.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Algorithm for Performance Optimal Technology Mapping with Retiming in LUT-Based FPGA Desig

A novel algorithm, named SeqMapII, of technology mapping with retiming for optimal clock period for K-LUT based FPGAs was recently proposed by Pan and Liu 13]. The time complexity of their algorithm, however , is O(K 3 n 4 log(Kn 2) logn) for sequential circuits with n gates, which is too high for medium and large size designs in practice. In this paper, we present three strategies to improve t...

متن کامل

An Eecient Algorithm for Performance-optimal Fpga Technology Mapping with Retiming

It is known that most FPGA mapping algorithms consider only combinational circuits. Pan and Liu 22] recently proposed a novel algorithm, named SeqMapII, of technology mapping with retiming for clock period minimization. Their algorithm, however, requires O(K 3 n 5 log(Kn 2)log n) runtime and O(K 2 n 2) space for sequential circuits with n gates. In practice, these requirement are too high for t...

متن کامل

An Optimal Performance-driven Technology Mapping Algorithm for Lut-based Fpgas under Arbitrary Net-delay Models

The field programmable gate-array (FPGA) has become an important technology in VLSI ASIC designs. Most existing performance-driven technology mapping algorithms for Lookup-table (LUT) based FPGA designs are based on unit delay model. In this paper we present an efficient algorithm which finds an optimal technology mapping solution with minimum delay under arbitrary net delay models for LUTbased...

متن کامل

FlowMap: an optimal technology mapping algorithm for delay optimization in lookup-table based FPGA designs

This paper represents pioneering work in the area of technology mapping for FPGAs and presents a theoretical breakthrough by solving the LUT-­‐based FPGA technology mapping problem for depth minimization optimally in polynomial time. The key idea was to model the LUT mapping problem as the computation of a minimum-­‐height, K-­‐feasible cut. This was the first global formulation of the LUT mapp...

متن کامل

An Efficient LUT Design on FPGA for Memory-Based Multiplication

An efficient Lookup Table (LUT) design for memory-based multiplier is proposed.  This multiplier can be preferred in DSP computation where one of the inputs, which is filter coefficient to the multiplier, is fixed. In this design, all possible product terms of input multiplicand with the fixed coefficient are stored directly in memory. In contrast to an earlier proposition Odd Multiple Storage ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996